Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire
نویسندگان
چکیده
Speaker change detection is an important task in multi-party interactions such as meetings and conversations. In this paper, we address the speaker from perspective of sequence transduction. Specifically, propose a novel encoder-decoder framework that directly converts input feature to identity sequence. The difference-based continuous integrate-and-fire mechanism designed support framework. It detects changes by integrating difference between encoder outputs frame-by-frame transfers segment-level embeddings according detected changes. whole supervised sequence, weaker label than precise points. experiments on AMI DIHARD-I corpora show our sequence-level method consistently outperforms strong frame-level baseline uses labels.
منابع مشابه
Speaker Change Detection based on Mean Shift
To settle out the problem that search of speaker change point (SCP) is blind and exhaustive, mean shift is proposed to seek SCP by estimating the kernel density of speech stream in this paper. It contains three steps: seeking peak points using mean shift firstly, using maximum likelihood ratio (MLR) to compute the MLR value of the peak points secondly, and seeking SCPs from MLR value using the ...
متن کاملConductance-Based Integrate and Fire Models
A conductance-based model of Na+ and K+ currents underlying action potential generation is introduced by simplifying the quantitative model of Hodgkin and Huxley (HH). If the time course of rate constants can be approximated by a pulse, HH equations can be solved analytically. Pulse-based (PB) models generate action potentials very similar to the HH model but are computationally faster. Unlike ...
متن کاملIntegrate and Fire Neurons
SpikeNET is a simulator for modeling large networks of asynchronously spiking neurons. It uses simple integrate-and-fire neurons which undergo step-like changes in membrane potential when synaptic inputs arrive. If a threshold is exceeded, the potential is reset and the neuron added to a list to be propagated on the next time step. Using such spike lists greatly reduces the computations associa...
متن کاملOn-line incremental speaker adaptation with automatic speaker change detection
In order to improve the performance of speech recognition systems when speakers change frequently and each of them utters a series of several sentences, a new unsupervised, online and incremental speaker adaptation technique combined with automatic detection of speaker changes is proposed. The speaker change is detected by comparing likelihoods using speaker-independent and speaker-adaptive GMM...
متن کاملChange detection from satellite images based on optimal asymmetric thresholding the difference image
As a process to detect changes in land cover by using multi-temporal satellite images, change detection is one of the practical subjects in field of remote sensing. Any progress on this issue increase the accuracy of results as well as facilitating and accelerating the analysis of multi-temporal data and reducing the cost of producing geospatial information. In this study, an unsupervised chang...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Signal Processing Letters
سال: 2022
ISSN: ['1558-2361', '1070-9908']
DOI: https://doi.org/10.1109/lsp.2022.3185955